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Human immunodeficiency virus type I (HIV-1), a causative agent of AIDS, is affecting today more than 35 millions of people 
worldwide. The advance of anti-HIV chemotherapy has made AIDS a chronic non-fatal disease in resourceful countries. Long- 
awaited anti-HIV-1 vaccine is still not with us yet; however, great progress in structural analyses of the envelope protein of HIV-1 


in recent years starts to shed light on rational intervention targeted at the envelope protein, as will be reviewed in this article. 
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It has been more than 30 years now since the discovery of 
human immunodeficiency virus type I (HIV-1) as a causa- 
tive agent of AIDS. During the past three decades, re- 
searchers have elucidated the complex nature of this unique 
retrovirus. Significant advance in chemotherapy has made 
HIV-1 infection as a chronic manageable disease in devel- 
oped countries. Another arm of intervention, a vaccine 
against HIV-1, is unfortunately not with us yet [1,2]. How- 
ever, our understanding of the structure of the envelope 
protein (Env) of HIV-1, a major target of vaccine, is signif- 
icantly advanced in recent years. In addition to the X-ray 
crystallographic study, the cryo-electron microscopy (Cry- 
oEM) technology is helping us grasp the native structure of 
HIV-1 Env. Now we are able to describe the mechanism of 
membrane fusion and antibody-mediated neutralization at 
the molecular level. Here I will review some of recent ad- 
vance of the structural analysis of HIV-1 Env. 


1 Virology of HIV-1 
Before getting into the discussion of the structure of Env, I 
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will briefly describe the virological characteristics of HIV-1. 
HIV-1 is a complex retrovirus belonging to the lentivirus 
family. HIV-1 is not an oncogenic retrovirus, but induces 
immunodeficiency. Common to other replication competent 
retroviruses, HIV-1 has three structural genes encoding Gag 
(group specific antigen), Pol (polymerase), and Env. Be- 
sides these structural genes, HIV-1 has six additional genes. 
Two genes, fat and rev, are classified as essential genes and 
play a critical role in transcription and transport of mRNAs, 
respectively [3]. Other four genes are called accessory 
genes because they are dispensable for viral replication in 
T-cell lines in vitro. However, accessory gene products are 
needed to modify the host environment to allow efficient 
virus replication in vivo. Many accessory gene products 
work as adaptors to destroy host restriction factors. Due to 
the limitation of its small genome size, the virus seems to 
target host factors whose presence is very detrimental to its 
replication. The initial study of Vif led to the identification 
of an APOBEC family as a major restriction factor [4]. 
Subsequent studies of Vpu, Vpr, Nef, and finally Vpx of the 
related virus HIV-2/SIV identified respective restrictive 
host factors [5]. These studies are tightly linked with the 
cell biology, and the researchers in the field are now more 
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cautious about the choice of cell lines they use, since some 
of the cellular restriction factors are abundant in a particular 
cell line and not in others. 


2 Host immune responses and viral mutations 


The above-mentioned accessory gene products deal with the 
host innate immunity. HIV-1 copes with the dynamic ac- 
quired immunity, both humoral and cellular immunities, by 
mutations. The robust HIV-1 replication mediated by the 
error- and recombination-prone reverse transcriptase con- 
tributes to them. Shortly after the infection, HIV-1 estab- 
lishes quasispecies that adapt to the pressure of the immune 
responses [6]. Being on the surfaces of virions and infected 
cells, Env is a major target of the host immune responses. 
The diversity of Env sequence is enormous and antibody 
responses usually cannot catch up with the mutations of Env. 
This is further worsened because HIV-1 targets the key 
player of the host immune system, CD4 positive T cells, and 
induces their death or dysfunction [7]. Most of the HIV-1 
infected individuals would succumb to death without an 
appropriate intervening therapy. However, there are some 
people who can manage HIV-1 infection. Such people 
called elite controllers can control HIV-1 replication and 
stay disease-free without the anti HIV-1 chemotherapy. 
Analyses of these elite controllers, especially elite neutral- 
izers, would lead to the identification of broadly neutraliz- 
ing antibodies. 

There are only handful of classes of broadly neutralizing 
antibodies (bnAbs) [8,9]. The breadth of the neutralizing 
activity of a particular bnAb is evaluated from the efficien- 
cy of its neutralization of different subtypes of HIV-1. The 
subtypes of HIV-1 are called clades. Clades are based on 
the nucleotide and amino acid sequences, and the sequences 
can differ from clade to clade by about 20%—-30%. The 
bnAbs usually manifest cross-clade neutralization. One 
should be aware that a clade is not necessarily directly 
linked with any biological phenotypes and should not be 
confused with serotypes in other viruses. 


3 Envelope protein of HIV-1 


The target of the bnAbs is the Env protein of HIV-1. HIV-1 
Env is a heavily glycosylated membrane protein composed 
of the heterodimer of gp120 and gp41. These two subunits 
are generated from a precursor protein called gp160 by pro- 
teolytic processing by cellular proteases. The mature HIV-1 
Env is a trimer of these heterodimers. The gp120 subunit is 
responsible for receptor/coreceptor recognition. The recep- 
tor of HIV-1 Env is CD4 and there are two major corecep- 
tors for HIV-1, namely CXCR4 and CCR5 [10]. Upon the 
interaction between the gp120 and receptor/coreceptor, 
gp120 will undergo the conformational changes and this, in 
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turn, is believed to trigger the cascade of conformational 
changes of the associated gp41 subunit, which plays a major 
role in induction of membrane fusion. The gp41 molecule 
belongs to a class I fusion protein like HA2 of influenza 
virus [11]. The class I fusion protein is composed of trimer 
of mainly helix proteins and forms a quaternary structure 
called six-helix bundle (6HB) during the membrane fusion. 
The atomic structures of various 6HB have been determined 
[12]. For the long time, the only available structures for 
gp41 were those of 6HB. 


4 Structure of HIV-1 Env 


The gp120 subunit, being heavily glycosylated (nearly the 
half of the mass is from sugar moieties) and with five highly 
variable flexible loops, has been a difficult target for the 
structural analysis. The first seminal structure was deter- 
mined for the engineered monomeric gp120, in which the 
most of the variable loops were replaced with short linker 
sequences. Furthermore, the engineered gp120 was stabi- 
lized by mixing it with the fragment of CD4 and a mono- 
clonal antibody, 176 [13]. Since then, several structures of 
gp120, both in the free form or complex with other ligands, 
have been determined. However, the quaternary structure 
containing both gp120 and gp41 in the trimeric form has 
been resistant to crystallization. Meanwhile, CryoEM start- 
ed to provide the quaternary structure of Env [14—17] in low 
resolution. Applying both crystallography and CryoEM 
approaches, the group at Scripps Institute revealed the over- 
all structure of gp120/gp41 complex in a relatively high 
resolution in 2013 [18,19]. This is another landmark 
achievement in the field. 

The HIV-1 Env protein used for both CryoEM and crys- 
tallographic analyses was derived from the BG505 strain 
belonging to clade A. The protein was truncated at residue 
664 in the beginning of membrane proximal external region 
(MPER), and remains as a soluble non-membrane anchored 
form. Several modifications were introduced to facilitate 
structural analyses: (i) gp120 and gp41 are connected by an 
artificial disulfide bond (between residues 501 of gp120 and 
605 of gp41, termed SOS) to avoid shedding of gp120, (ii) 
the isoleucine residue at 559 of gp41 was mutated to proline 
(termed IP) to stabilize gp41-gp41 association. These two 
modifications have been shown not to affect the overall 
structures of gpl20. Thus modified HIV-1 Env called 
BG505 SOSIP.664 was more thermostable than the native 
Env. In addition to these modifications, the engineered Env 
was stabilized by complexing with neutralizing antibodies. 

The overall structure of gp120 subunit of BG505 
SOSIP.664 is similar to the previously determined structure. 
The structural information on variable loops often missed in 
the previous crystal structure was provided except for V4. 
V1/V2 and V3 loops are located in the central apical region 
of the gp120 trimer. V5 and V4 loop are located to the pe- 
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ripheral region of the gp120 trimer. The part of the V3 loop 
was located underneath the V1/V2 loop (Figure 1). 
Unfortunately the resolution of gp41 portion was not 
high enough to allow the assignment of individual amino 
acid residues in gp41. However, both the N-terminal heptad 
repeat (NHR) and C-terminal heptad repeat (CHR) were 
resolved as separated helices. NHR stood perpendicular to 
the viral membrane and CHR lying horizontally under the 
NHR trimers. The C-terminal half of NHR forms a trimeric 
helix bundle-like structure. The similar trimeric he- 
lix-bundle of NHR is observed in the post fusion six-helix 
bundle (6HB) structure, but the revealed three-helix bundle 
most likely represents the prefusion state rather than acti- 
vated intermediate. The N-terminal region of gp41 contain- 
ing fusion peptide (FP) and the connection to the NHR re- 
gion was not resolved. The CHRs located near the bottom 
of the NHR bundle are wrapping around the base of NHR 
trimer to form a triangular shape. It was shown that the 
middle portion of CHR has a strong interaction with the 
bottom of NHR. This structure denies the presence of pre- 
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formed 6HB in the native HIV-1 Env as proposed before 
[20], and suggests the complex conformational rearrange- 
ments during the formation of 6HB. 


5 Further update of HIV-1 Env structural and 
its implication for membrane fusion 


The molecular details of HIV-1 Env-mediated membrane 
fusion have been elusive. The above mentioned architec- 
tures of gp120 and gp41 clarify some aspects of structural 
rearrangement of the gp120/gp41 during membrane fusion. 
In the prefusion state, the exterior hydrophobic surface of 
the NHR is covered by the surrounding inner domain of 
gp120. This surface of the NHR bundle will eventually in- 
teract with the CHR to form the six-helix bundle. The re- 
ceptor binding will induce positional shift of gp120 from 
the closed to open state; this allows the exterior surface of 
NHR to interact with the corresponding region of CHR. 
How this association of NHR and CHR is exactly achieved 


Figure 1 Structure of BG505 SOSIP.664 determined by CryoEM (PDB 4NCO). Note that bound PGV04 bnAb is not shown to show HIV-1 Env region 
clearly. A, Side view. Main chains of two gp120 subunits are shown in gold. V1V2 and V3 loops are shown in green and magenta, respectively. They are in 
the apical center of trimer complex. Three gp41 subunits are shown in blue, red, and yellow. For the clear view of the gp41 bundle, the gp120 subunit corre- 
sponding to blue gp41 is removed. NHR and CHR helices are indicated. The kink in NHR is shown with an asterisk. B, Top view corresponding to the panel 
A is shown. The gp120 subunit removed in the panel A is shown in cyan. The V1V2, V3 are shown in the same color as in A. Dashed lines indicate the 
position of the missing V4 loops. C, The bottom view (seen from the viral membrane). Three CHRs of gp41 form a triangle at the base of the NHR bundle. 
The positions of the kink in CHR are shown with the asterisks. D, The structure of six-helix bundle (6HB) of gp41 (PDB1AIK) is shown for a reference. In 
the 6HB, CHR is tightly docked into the grooves generated by the central bundle of NHR. NHR and CHR of the blue subunit are indicated. The images were 


generated by the UCSF Chimera program. 
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is the interest of the researchers like us who are interested in 
the mechanism of membrane fusion. Molecular insight of 
these conformational changes are finally revealed by the 
Group of P. Kwong from National Institute of Allergy and 
Infectious Diseases [21] with the revelation of gp120/gp41 
structure with a higher resolution in 2014. They worked on 
the same BG505 SOSIP 664 envelope protein, but they 
added the second bnAb (35022) recognizing the 
gp120/gp41 boundary region. This improved the resolution 
of the crystal structure and assignment of the amino acid 
residues of gp41 in prefusion status was provided for the 
first time. This landmark work provided the model for the 
possible structural rearrangements of gp41 during mem- 
brane fusion. By the comparison of the new structure with 
the previously determined 6HB structure, a region of gp41 
was shown to undergo a structural change from the 
non-helical state to the helical structure during membrane 
fusion. This structural rearrangement is similar to the pre- 
viously shown spring-loading model in influenza HA2 dur- 
ing membrane fusion. Here a common structural rear- 
rangement for both class I fusion proteins, gp41 and HA2, 
was established. This model will guide us to further under- 
stand the molecular basis for the detailed membrane fusion 
mechanism. 

We have been interested in the structure and function of 
gp41. Different from another report of nearly full length 
HIV-1 Env [22], the current structure of BG505 SOSIP.644 
misses the structural information on the MPER, mem- 
brane-spanning domain (MSD) and cytoplasmic region of 
gp41. MPER is one of the major targets of the bnAbs and 
MSD clearly plays a critical role in biological function of 
HIV-1 Env [23]. Although it would be a challenging task, 
further elucidation of the structure of full length Env in- 
cluding MSD and cytoplasmic tail together with lipid bi- 
layers is highly anticipated. 


6 Broadly neutralizing antibodies and vaccine 


Determination of the structure of the complex between 
SOSIP and bnAbs provides us the molecular basis for the 
mechanism of neutralization by bnAbs. Since not so much 
molecular information is available on the vast majority of 
other viruses, this understanding of neutralization at the 
molecular level is somehow unique for HIV-1. Several dif- 
ferent categories of bnAbs against HIV-1 have been identi- 
fied [8]. The bnAb directed to the receptor/coreceptor bind- 
ing sites of gp120 represents one category and its mecha- 
nism of neutralization is relatively straightforward: it inter- 
feres with the interaction between the receptor/coreceptor 
and Env, and blocks the subsequent conformational changes 
of Env. Other categories of bnAbs bind to the different re- 
gions of Env and they are now shown to interfere with the 
structural changes of Env necessary for membrane fusion. 
These may include bnAns binding to (i) V1/V2 loops of 
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Env with associated glycans, (ii) the apex of gp120 with 
association with attached glycans, (iii) gp41 MPER. The 
most recently identified bnAb targets the gp120/gp41 
cleavage site [24,25]. 

Glycans are once regarded as an obstacle for the devel- 
opment of effective antibody responses, since it is rather 
immunologically inert to the host. When we learn the fact 
that the interactions of many bnAbs and HIV-1 Env depend 
on the presence of glycans, we may have to modify our 
view. Probably some of the glycan is essential for the struc- 
tural integrity of HIV-1 Env while others are placed as in- 
terfering barriers against antibodies. Recent progress in the 
technologies now allows us to examine types of different 
sugar moieties attached to Env. One interesting insight ob- 
tained through these detailed structural information on 
Env-bnAb complexes is that not only the target region but 
the way the particular antibody binds determine the out- 
come. Two antibodies bound to the very close regions may 
become neutralizing or non-neutralizing depending on the 
direction of the antibody binding. 

As discussed above, we now have much better under- 
standing of the structures of native Env itself and those of 
Env-bnAb complexes. The next remaining issue is how to 
induce such bnAbs responses in humans. A recent report by 
Doria-Rose et al. [26] showed the development pathway for 
potent V1/V2- directed neutralizing antibodies in an HIV-1 
infected individual. This study involves a sophisticated in 
vitro clonal analysis of antibody producing B cells derived 
from elite neutralizers in combination with structural anal- 
yses of antibodies. In the body of this person, there is clear 
evolution of potent bnAbs directed against HIV-1. To our 
surprise, this evolution took place in relatively short time. 
The question is how general this kind of observation is and 
how to elicit this kind of Ab by man-made immunogens. It 
is interesting to see how the research pathway in this field 
will evolve in future. 
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